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SHORT INTERFERING RNA AND MICRO-RNA COMPOUNDS 
AND METHODS OF DESIGNING, MAKING AND USING THE SAME 

FIELD OF THE INVENTION 

The present invention relates to siRNA and miRNA compounds and methods 
of designing, synthesizing and using them to inhibit production of protein by 
inhibiting mRNA translation, 

BACKGROUND OF THE INVENTION 

An evolutionary conserved mechanism of gene expression regulation, based 
on small, noncoding RNAs, exists in diverse organisms from Schizosaccharomyces 
pombe (1) to humans (2) (3). A class of ~22 nucleotide (nt) small RNAs, termed 
short interfering RNAs (siRNAs) (4) (5) are critical determinants of RNA interference 
(RNAi), an RNA-based silencing mechanism (6), whose physiological fonction 
includes protecting the genome against invading nucleic acids such as viruses and 
transposons (7) (S). 

In RNAi, double-stranded RNA (dsRNA) is converted by Dicer, an RNAse 
Hi-type nuclease (9), into siRNA duplexes that incorporate in ribonucleoprotein 
complexes (RNPs) termed RNA-induced silencing complexes (RISCs) (10). 
Unwinding of siRNA duplexes activates RISC and enables it to recognize target 
RNAs that are complementary to the guide strand of the siRNA (77). An unknown 
endonuclease(s) present in RISC cleaves the target RNA across from the center of the 
guide siRNA (5) (77). 

Another class of small ~22nt RNAs, termed microRNAs (miRNAs) (2) (72) 
(75) (3) are processed by Dicer (14) (15) (16) from the stems of longer (~75nt) RNA 
precursors (pre-miRNAs) that form stem-loop structures. miRNAs incorporate in 
microRNPs (miRNPs) along with at least three proteins: eukaryotic translation 
Initiation Factor 2C2 (eIF2C2), a member of the Argonaute family of proteins; 
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Gemin3, a putative RNA helicase; and Gemin4 (3). Animal miRNAs recognize 
sequences with partial antisense complementarity in the 3' untranslated regions of 
their target mRNAs and prevent the accumulation of nascent polypeptides (77) (75) 
(79) (20). 

Argonaute proteins (27) are essential for RNAi and play critical roles in the 
maturation and function of si/miRNAs (22) (23) (24) (25) and are also present in 
Drosophila (10) and human RISC complexes (26); their biochemical functions 
however, are unknown. Hutvagner and Zamore have recently made the important 
observation that RISC and miRNPs are very similar complexes because 
immunopurified miRNPs containing let-7, an endogenous miRNA, are able to cleave 
a target RNA that contains a fully complementary site to let-7 (27). Furthermore, 
plant miRNAs with extensive complementarity to their target mRNAs act as siRNAs 
by cleaving their mRNA targets (28) (29) (30). These findings indicate that a 
miRNA-directed endonuclease, similar to the siRNA-directed RISC endbnuclease, is 
present in miRNPs. 

Since the discovery of RNAi, much effort has been directed toward using 
RNAi to control gene expression, such as in the treatment of disease and infection. 
For example, RNAi is proposed for use in treatment of cancer by post transcriptional 
inhibition of oncogene expression. Other targets for RNAi post translational 
inhibition of gene expression include other disease causing genes or genes whose 
expression causes undesirable effects or symptoms. RNAi has also been proposed to 
inhibit gene expression of pathogen genes such as viral genes. 

To use RNAi in the treatment of disease and infection, siRNA or miRNA are 
designed based upon the nucleotide sequence of the mRNA that is targeted for post 
transcriptional inhibition of expression. An siRNA or miRNA so designed may not 
only inhibit the targeted mRNA but other mRNA as well. This lack of specificity can 
cause unwanted side effects by inhibiting expression of genes whose expression is 
necessary. 

There remains a need for improvements in the methods of designing, 
synthesizing and use of siRNAs and miRNAs. There is a need for methods of 
determining whether an siRNA or miRNA will be target specific. There is a need for 
improved siRNAs and miRNAs. 

SUMMARY OF THE INVENTION 
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The present invention relates to methods of identifying a uniquely targeting 
siRNA nucleotide sequence for a target mRNA sequence of a target species. The 
methods comprise the steps of: comparing a database of mRNA sequences from the 
target species w^ith an siRNA nucleotide sequence that consists of 18-25 including at 
least 1 1 consecutive nucleotides complementary to the target mRNA sequence to be 
cleaved by the siRNA nucleotides, wherein the at least 1 1 consecutive nucleotides 
complementary to the target mRNA sequence include a nucleotide that is third from 
an siRNA nucleotide sequence's 5' end; and determining if, in addition to the target 
mRNA sequence, one or more additional mRNA sequences in the database are 
complementary to an 11 consecutive nucleotide sequence of the siRNA nucleotide 
sequence including the third nucleotide from the 5' end of the siRNA nucleotide. An 
absence of one or more additional mRNA sequences in the database that are 
complementary to an 11 consecutive nucleotide sequence of the siRNA nucleotide 
sequence including the third nucleotide from the 5' end of the siRNA nucleotide 
indicates that the siRNA nucleotide sequence is a uniquely targeting siRNA 
nucleotide sequence. 

The present invention further relates to methods of designing a uniquely 
targeting siRNA for a target mRNA molecule. The methods comprise the steps of 
identifying an siRNA nucleotide sequence for the target mRNA, said sequence 
consisting of 18-24 nucleotides including a nucleotide sequence that has 1 1 
consecutive nucleotides, including the third nucleotide from the siRNA nucleotide 
sequence's 5' end, that are complementary to an 1 1 nucleotide sequence that occurs 
on the target mRNA molecule; comparing the siRNA nucleotide sequence with a 
database of mRNA sequences from the target mRNA species; and determining if, in 
addition to the target mRNA sequence, one or more additional mRNA sequences in 
the database are complementary to an 1 1 consecutive nucleotide sequence of the 
siRNA nucleotide sequence including the third nucleotide from the 5' end of the 
siRNA nucleotide. An absence of one or more additional mRNA sequences in the 
database that are complementary to an 1 1 consecutive nucleotide sequence of the 
siRNA nucleotide sequence including the third nucleotide from the 5' end of the 
siRNA nucleotide indicates that the of the siRNA nucleotide is a uniquely targeting 
siRNA nucleotide sequence. 

The present invention further relates to methods of synthesizing a uniquely 
targeting siRNA for a target mRNA molecule. The methods comprise the steps of: 
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identifying or designing uniquely targeting siRNA nucleotide sequence for the target 
mRNA according to any of claims 1 to 12; and synthesizing an siRNA molecule 
having the uniquely targeting siRNA nucleotide sequence. 

The present invention further relates to a method of inhibiting expression of a 
target mRNA molecule comprising the steps of: synthesizing a uniquely targeting 
siRNA for a target mRNA molecule and contacting a target mRNA molecule with an 
RISC that comprises an siRNA molecule that comprises the uniquely targeting siRNA 
nucleotide sequence. 

The present invention further relates to methods of identifying an miRNA 
nucleotide sequence that does not function as a siRNA nucleotide sequence for 
mRNA of a target species comprising the steps of: comparing a database of mRNA 
sequences from the target species with an miRNA nucleotide sequence that consists of 
18=24 nucleotides; and determining if one or more mRNA sequences in the database 
are complementary to em 1 1 consecutive nucleotide sequence of the miRNA 
nucleotide sequence including the third nucleotide from the 5' end of the miRNA 
nucleotide. An absence of one or more mRNA sequences in the database that are 
complementary to an 1 1 consecutive nucleotide sequence of the miRNA nucleotide 
sequence including the third nucleotide from the 5' end of the miRNA nucleotide 
indicates that the miRNA nucleotide sequence does not function as a siRNA 
nucleotide sequence for mRNA of a target species. 

The present invention further relates to methods of designing an miRNA 
nucleotide sequence that does not function as a siRNA nucleotide sequence for 
mRNA of a target species. The methods comprises the steps of: identifying an 
miRNA nucleotide sequence that consists of 1 8-24 nucleotides; comparing the 
miRNA nucleotide sequence with a database of mRNA sequences from the target 
mRNA species; and determining if one or more mRNA sequences in the database are 
complementary to an 1 1 consecutive nucleotide sequence of the miRNA nucleotide 
sequence including the third nucleotide from the 5' end of the miRNA nucleotide. An 
absence of one or more mRNA sequences in the database that are complementary to 
an 1 1 consecutive nucleotide sequence of the miRNA nucleotide sequence including 
the third nucleotide from the 5' end of the siRNA nucleotide indicates that the 
miRNA nucleotide sequence does not function as a siRNA nucleotide sequence for 
mRNA of a target species. 



-4- 



WO 2005/042705 



PCT7US2004/035124 



The present invention further relates to methods of synthesizing an miRNA 
that does not function as a siRNA nucleotide sequence for mRNA of a target species. 
The methods comprise the steps of: identifying or designing an miRNA nucleotide 
sequence that does not function as an siRNA nucleotide sequence for mRNA of a 
target species according to any of claims 15 to 26; and synthesizing an miRNA 
molecule that has an miRNA nucleotide sequence that does not function as an siRNA 
nucleotide sequence for mRNA of a target species. 

The present invention further relates to methods of inhibiting expression of a 
target mRNA molecule. The methods comprise the steps of: synthesizing an miRNA 
that does not function as a siRNA nucleotide sequence for mRNA of a target species 
and contacting a target mRNA molecule with an miRNP that comprises an miRNA 
that does not function as a siRNA nucleotide sequence for mRNA of a target species. 

The present invention farther relates to uniquely targeting siRNA molecule 
consisting of 18-25 nucleotides including a nucleotide sequence that has at least 1 1 
consecutive nucleotides, including the third nucleotide from the 5' end of the siRNA 
molecule, that are complementary to a nucleotide sequence that occurs on a target 
mRNA molecule and not on another mRNA from a target species and RISCs 
comprising same. 

The present invention further relates to siRNA molecules consisting of 1 8-25 
nucleotides including a nucleotide sequence that has at least 1 1 consecutive 
nucleotides, including the third nucleotide from the 5' end of the siRNA molecule, 
that are complementary to a nucleotide sequence that occurs on a target mRNA 
molecule and not on another mRNA from a target species, wherein the siRNA 
molecule is not fully complementary with a sequence of the target mRNA molecule 
and RISCs comprising same. 

The present invention further relates to miRNA molecules for a target mRNA 
of a target species, wherein said miRNA molecule consists of 18-25 nucleotides and 
does not function as a siRNA nucleotide sequence for mRNA of a target species. 

BRIEF DESCRIPTION OF THE FIGURES 

Figures 1 A-1D show data from the characterization of purified eIF2C2 protein 
and associated miRNAs. In Figure 1 A, aliquots from indicated eluates were resolved 
on a 4-12% NuPAGE gel and visualized by silver staining. Arrow indicates 0.4jig of 
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purified human eIF2C2 protein. Molecular mass markers (in kilodaltons) is shown on 
the left. In Figure IB, RNA was isolated from total HeLa cell lysate (T), from 8C7 
eluate containing lOOng of immunopurified eBF2C2 protein and from an equivalent 
volume of control IgG eluate; the RNA was 3'-end labeled with [5'- 32 P]-pCp and was 
resolved by electrophoresis on a 10% denaturing polyacrylamide gel. Nucleotide 
sizes of the radiolabeled marker (M) are shown on the left. In Figure 1C, RNA 
isolated from 8C7 eluate, containing 5 pmoles of purified eIF2C2, along with the 
indicated amounts of a synthetic let-la RNA, was analyzed by Northern hybridization 
using a radiolabeled oligonucleotide probe complementary to let-7a. In Figure ID, 
quantitation of C; black circles: synthetic let-7; open square: let-7 from 8C7 eluate. 

Figures 2A-2C show that the eIF2C2/fe*7a-miRNA ribonucleoprotein 
(miRgonaute) is a miRNP/RISC endonuclease. Figure 2A shows a schematic of the 
RNA targets used (designated let7TI and let7-TP) and the potential base pairing with 
let-7 a\ the [5'- 32 P] of pCp is shown. Cleavage sites are indicated with lightning bolt. 
In Figure 2B, the indicated, 3 '-end radiolabeled RNA targets were incubated with 
purified fef-7-containing eIF2C2 (eIF2C2/fe/-7) or with IgG eluate. The products of 
the reactions were analyzed on 15% denaturing polyacrylamide gels. In Figure 2C, 
3 '-end labeled let7-TP RNA target was incubated with purified eEF2C2//etf-7a for the 
times indicated either with or without ATP or Mg** ions (as shown), and were 
analyzed as in Figure 2B. Nucleotide sizes of the radiolabeled marker (M) are shown 
on the left. The RNA targets are indicated by brackets and the 3 '-cleavage products 
by arrows. 

Figures 3 A-3F relate to target RNA-miRNA pairing requirements for 
miRgonaute mediated cleavage. Figures 3A, 3C and 3E show schematic of the RNA 
targets used (designated let7TA, Iet7-TB, let7TC and Iet7-TD) and the potential base 
pairing with let-7a; the 32 P radiolabel is shown as p. Lightning bolts indicate cleavage 
sites. In Figures 3B, 3D and 3F, the RNA targets were incubated with (+) or without 
(-) purified fe^7a-containing-eIF2C2 (eIF2C2//e/- 7a) and the products of the reaction 
were analyzed on a 20% denaturing polyacrylamide gel. The cleavage products are 
indicated with arrows. 

Figures 4A and 4B shows the minimal requirements for target RNA 
recognition and cleavage by mi/siRNAs. Figure 4A shows a schematic of a 
miRgonaute ribonucleoprotein bound to its cognate mRNA target. miRNA 
nucleotides that are critical for target recognition and cleavage are shown base paired 
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with the target RNA nucleotides. Other, as yet unidentified, factor(s) (Factor X) 
might also contribute to target RNA recognition and cleavage, in vivo. Figure 4B 
shows a schematic of a mi/siRNA bound to its target. The stretch of the critical 
nucleotides of an mi/siRNA required for target RNA recognition and cleavage are 
indicated (nucleotides between the two arrows). To design siRNAs with high 
specificity these nucleotides should not base pair with any mRNA but the one to be 
targeted. Cleavage site of the target RNA is indicated with a lightning bolt. 

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 

The discovery that less than complete complementarity is required between an 
siRNA sequence and target RNA in order for RISC to function provides the 
opportunity to undertake several procedures to improve applications of RNAi 
technology as well as providing the basis for uniquely targeted siRNA compounds. 
Improved applications include 1) methods of determining whether or not an siRNA 
sequence uniquely targets a target RNA sequence, 2) improved methods of designing 
siRNA sequences, 3) improved methods of synthesizing siRNA sequences, and 4) 
improved methods of inhibiting gene expression. The discovery also provides the 
basis for siRNA compounds that uniquely target a target RNA and siRNA compounds 
that are functional siRNA compounds despite lacking complete complementarity to a 
target RNA sequences. 

As used herein, the term "uniquely targeting siRNA" is meant to refer to an 
siRNA which will only be functional as part of a RISC to eliminate a target RNA and 
will not eliminate a different RNA present, such as a different RNA from the same 
species as that of the target RNA. 

An siRNA will be functional as part of a RISC, i.e. it will eliminate a target 
RNA, if eleven consecutive nucleotides of its sequence, including the third nucleotide 
from the 5' end are completely complementary to those of the target RNA. 

An siRNA sequence can be 18-25 nucleotides, and preferably about 22 
nucleotides. An siRNA is generated by the processing of double stranded RNA 
(dsRNA) sequences of a larger RNA molecule by the enzyme DICER into siRNA 
duplexes which are short double stranded RNAs with each strand having 18-25 
nucleotides. Generally, the siRNA duplex has a two nucleotide overhang on each 
end. The siRNA together with certain proteins forms a ribonucleoprotein complex 
which is referred to as a RISC. The siRNA ceases to be a duplex in an active RISC. 
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When the single stranded siRNA of the RISC hybridizes with a target RNA, the RISC 
functions to degrade the target RNA. 

The degree of complementarity between a single stranded siRNA of the RISC 
and a target RNA required for the RISC to degrade the target RNA is less than 
complete. As used herein <6 the rule" is meant to refer to the requirement that a single 
' stranded siRNA of the RISC have a sequence of at least eleven consecutive 
nucleotides that are fully complementary with an eleven nucleotide sequence of the 
target RNA and that the at least eleven consecutive nucleotides of the siRNA include 
the third nucleotide from the 5* end of the siRNA. Accordingly, for a siRNA with 22 
nucleotides, to fit the rule, full complementarity of the siRNA sequence with a target 
RNA sequence of the following siRNA nucleotides is sufficient to produce a 
functional RISC: 1-11, 1-12, 1-13, 1-14, 1-15, 1-16, 1-17, 1-18, 1-19, 1-20, 1-21, 1- 
22, 2-12, 2-13, 2-14, 2-15, 2-16, 2-17, 2-18, 2-19, 2-20, 2-2L 2-22, 3-13, 3-14, 3-15, 
3-16, 3-17, 3-18, 3-19, 3-20, 3-21 and 3-22. For a siRNA with 18 nucleotides, to fit 
the rule, full complementarity of the siRNA sequence with a target RNA sequence of 
the following siRNA nucleotides is sufficient to produce a functional RISC: 1-1 1, 1- 
12, 1-13, 1-14, 1-15, 1-16, 1-17, 1-18, 2-12, 2-13, 2-14, 2-15, 2-16, 2-17, 2-18, 3-13, 
3-14, 3-15, 3-16, 3-17 and 3-18. For a siRNA with 19 nucleotides, to fit the rule, full 
complementarity of the siRNA sequence with a target RNA sequence of the following 
siRNA nucleotides is sufficient to produce a functional RISC: 1-1 1, 1-12, 1-13, 1-14, 
1-15, 1-16, 1-17, 1-18, 1-19, 2-12, 2-13, 2-14, 2-15, 2-16, 2-17, 2-18, 2-19, 3-13, 3- 
14, 3-15, 3-16, 3-17, 3-18 and 3-19. For a siRNA with 20 nucleotides, to fit the rule, 
full complementarity of the siRNA sequence with a target RNA sequence of the 
following siRNA nucleotides is sufficient to produce a functional RISC: 1-1 1, 1-12, 
1-13, 1-14, 1-15, 1-16, 1-17, 1-18, 1-19, 1-20, 2-13, 2-14, 2-15, 2-16, 2-17, 2-18, 2- 
19, 2-20, 3-13, 3-14, 3-15, 3-16, 3-17, 3-18, 3-19, and 3-20. For a siRNA with 21 
nucleotides, to fit the rule, full complementarity of the siRNA sequence with a target 
RNA sequence of the following siRNA nucleotides is sufficient to produce a 
functional RISC: 1-11, 1-12, 1-13, 1-14, 1-15, 1-16, 1-17, 1-18, 1-19, 1-20, 1-21, 2- 
12, 2-13, 2-14, 2-15, 2-16, 2-17, 2-18, 2-19, 2-20, 2-21, 3-13, 3-14, 3-15, 3-16, 3-17, 
3-18, 3-19, 3-20, and 3-21. For a siRNA with 23 nucleotides, to fit the rule, full 
complementarity of the siRNA sequence with a target RNA sequence of the following 
siRNA nucleotides is sufficient to produce a functional RISC: 1-1 1, 1-12, 1-13, 1-14, 
1-15, 1-16, 1-17, 1-18, 1-19, 1-20, 1-21, 1-22, 1-23, 2-12, 2-13, 2-14, 2-15, 2-16, 2- 
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17, 2-18, 2-19, 2-20, 2-21, 2-22, 2-23, 3-13, 3-14, 3-15, 3-16, 3-17, 3-18, 3-19, 3-20, 
3-21, 3-22 and 3-23. For a siRNA with 24 nucleotides, to fit the rule, full 
complementarity of the siRNA sequence with a target RNA sequence of the following 
siRNA nucleotides is sufficient to produce a functional RISC: 1-1 1, 1-12, 1-13, 1-14, 
1-15, 1-16, 1-17, 1-18, 1-19, 1-20, 1-21, 1-22, 1-23, 1-24, 2-12, 2-13, 2-14, 2-15, 2- 
16, 2-17, 2-18, 2-19, 2-20, 2-21, 2-22, 2-23, 2-24, 3-13, 3-14, 3-15, 3-16, 3-17, 3-18, 
3-19, 3-20, 3-21, 3-22, 3-23 and 3-24. For a siRNA with 25 nucleotides, to fit the 
rule, full complementarity of the siRNA sequence with a target RNA sequence of the 
following siRNA nucleotides is sufficient to produce a functional RISC: 1-11,1-12, 
1-13, 1-14, 1-15, 1-16, 1-17, 1-18, 1-19, 1-20, 1-21, 1-22, 1-23, 1-24, 1-25, 2-12, 2- 
13, 2-14, 2-15, 2-16, 2-17, 2-18, 2-19, 2-20, 2-21, 2-22, 2-23, 2-24, 2-25, 3-13, 3-14, 
3-15, 3-16, 3-17, 3-18, 3-19, 3-20, 3-21, 3-22, 3-23, 3-24 and 3-25. Those where the 
complementary begins at nucleotide four from the 5' end or further 5' will not 
function, i.e. they do not fit the rule. 

It is possible to determine whether or not a given sequence fits the rule for 
known RNA sequences. In a preferred embodiment, this can be done in an automated 
fashion using software which is or can be programmed to test for the rule using a 
given sequence and a database of known RNA sequences. Generally, the database is 
to test a sequence against known RNA sequences from the same species. In some 
embodiments, the search algorithm is the BLAST algorithm which is readily 
obtainable and in the public domain. In some embodiments, the mRNA databases are 
selected from the database deposited with NCBI and the database deposited with 
ENSEMBL. Both databases are readily accessible and in the public domain. 

In some embodiments, a particular siRNA sequence or miRNA sequence is 
known and compared with RNA sequences to determine if the siRNA sequence will 
function with an RNA other than the target RNA, i.e. determine whether it uniquely 
targets the target RNA or if it will function against non-target RNA as well. In the 
case of testing an siRNA sequence, this information provides insight into whether or 
not a given siRNA will function beyond processing the desired target RNA. In the 
case of testing an miRNA sequence, this information provides insight into whether or 
not a given miRNA can function as an active siRNA against a target beyond 
functioning to inhibit translation of the desired target RNA. In either case, the RNAi 
sequence can be compared with known RNA sequences, preferably using a computer 
[program to test against a database of RNA sequences, to determine if any unintended 
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activity will result. The ability to test for predicted unintended activity allows for a 
number of candidate siRNA or miRNA sequences to be analyzed to select for those 
which will not have unintended activity. 

In some embodiments, a particular RNA sequence can be identified as one to 
be targeted for siRNA mediated degradation and using the target RNA sequence a 
uniquely targeted siRNA sequence can be designed. Sequences of 18-25 nucleotides 
can be tested to determine if those which fit the rule will not have unintended activity. 
Importantly, while not limited to eleven consecutive nucleotides including the third 
from the 5* end, only those eleven must have full complementarity. The remaining 7- 
14 nucleotides can independently be complementary or not. A computer can be 
programmed to use the sequence of the target RNA to generate nucleic acid sequences 
of 18-25 which fit the rule and are unique when compared to a database of known 
RNA sequences. In some embodiments, such a program may initially test 18-25 
nucleotide fragments of the target RNA to determine such fragments with full 
complementarity to the target RNA fit the rule for any non-target RNA sequences. In 
some embodiments, such a program may sequences of 18-25 nucleotides that fit the 
rule for the target RNA and determine if such sequences fit the rule for any non-target 
RNA sequences. 

Once a sequence has been determined to be a uniquely targeting siRNA for a 
particular target RNA, the sequence can be used to synthesize an active siRNA 
system. Commercially available kits may be employed to generate an active siRNA 
system for a given siRNA sequence. Such kits may be obtained from various 
commercial sources including Ambion, Inc. (Austin, TX); IMGENEX Corporation 
(San Diego, CA), InvivoGen (San Diego, CA); Minis (Madison, WI), New England 
Biolabs (Beverly, MA); Promega (Madison, WI); QIAGEN (Venlo, the Netherlands); 
and Spring Bioscience (Fremont, CA). Similarly, once it has been determined that an 
miRNA sequence will not target an RNA as a functional siRNA, the miRNA can be 
synthesized. 

Uniquely targeting siRNA for a particular target RNA may be used to inhibit 
expression of the target gene by eliminating the target mRNA, thereby preventing 
prior to translation and protein production. Functional miRNA for a particular target 
RNA may be used to inhibit expression of the target gene by interfering with 
translation of the target mRNA, and thus protein production. When the miRNA 



-10- 



WO 2005/042705 



PCT/US2004/035124 



sequence cannot generate an active siRNA sequence no unintended siRNA activity 
will be associated with use of the miRNA. 

In some embodiments, a particular siRNA sequence or miRNA sequence is 
known and compared with RNA sequences to determine if the siRNA sequence will 
function with an RNA other than the target RNA, i.e. determine whether it uniquely 
targets the target RNA or if it will function against non-target RNA as well. In the 
case of testing an siRNA sequence, this information provides insight into whether or 
not a given siRNA will function beyond processing the desired target RNA. In the 
case of testing an miRNA sequence, this information provides insight into whether or 
not a given miRNA can function as an active siRNA against a target beyond 
functioning to inhibit translation of the desired target RNA. In either case, the RNAi 
sequence can be compared with known RNA sequences, preferably using a computer 
[program to test against a database of RNA sequences, to determine if any unintended 
activity will result. The ability to test for predicted unintended activity allows for a 
number of candidate siRNA or miRNA sequences to be analyzed to select for those 
which will not have unintended activity. 

In some embodiments, siRNA compounds and RISCs comprising such siRNA 
compounds are provided in which the siRNA sequence is a uniquely targeting siRNA 
sequence. In some preferred embodiments, the uniquely targeting siRNA and the 
RISCs comprising the same are directed against RNA encoding oncogenes, tumor 
suppressor genes, genes causing neurogenerative diseases, viral genes, cytokines, 
interleukins, chemokines, co-stimulatory molecules and growth factors. In some 
preferred embodiments, the uniquely targeting siRNA and the RISCs comprising the 
same are directed against RNA encoding p53, MYC, MYB, PTEN, HER-2/NEU, 
TGF, PDGFR, Ras, Rb (Retinoblastoma protein), bcl-2, bax, phosphatase 2A, 
telomerase, pl6, p21, Cyclin Dependent Kinases (CDKs), TRAIL, TNF, Mitogen 
Activated Kinases (MAPKs), VHL, NfkB, Merlin, Schwanomin, Notch, APC, 
AXIN2, E-Cadherin, GPC3, AXT1,2. Patched, SUFU, FH, SDHB,C,D, WT-1, 
STK1 1, TSC1, TSC2, BMPR1 A, SMAD4, NF2, BHD, HRPT2, MUTYH, ATM, 
BLM, BRCA1, BRCA2, FANCA,C,D2,E,F,G, MSH2, MLH1, MSH6, PMS2, XPA, 
KT, MET, RET, HIF-1, b-catenin, FAS, FBXW7, GLI, HPEVE6, MDM2, AKT2, 
F0XOlA,3A, PI3KCA, CYCLIN Dl, HPVE7, TALI, TFE3, ALK, ABL1, BRAF, 
EGFR, EPHB2, ERBB2, FES, FGFRs, FLT3,4, JAK2, N-RAS, K-RAS, PDGFB, 
PDGFRB, EWSR1, RUNX1, SMAD2, TGFFBR1, 2, MYCN, HMGA2, 
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HOXA9,l 1,13, HOXll,llL2, MKK4, MLL, SS18, RARA, PTNP1,11, APP 
(Amyloid Precursor protein), Tau, synucleins, alpha-synuclein (PARK1), Parkin 
(PARK2), PARKS, Huntingtin, SOD (Superoxide Dismutase), Senataxin, 
Apolipoprotein-E, DJ-1 (PARL7), presenilin-1, preseniIin-2, IL-1, IL-2, IL-4. TNF, 
Interferons, (EFNA, IFNB, IFN-gamma), CD4, CD8, CD19, CDS, CD43, CD1, CD3, 
B-cell receptor, T-cell Receptor, VEGF, bcr/abl, HTV genes such asrev, env, gag, or 
CMV genes. 

In some embodiments, siRNA compounds and RISCs comprising siRNA 
compounds are provided in which the siRNA sequence is less than fully 
complementary with a target RNA sequence. These siRNA sequences fit the rule but 
are less than completely complementary. In some embodiments, 11, 12, 13, 14, 15, 
16, 17 or 18, (including the third from the 5' end) of 18 nucleotides are 
complementary with a sequence of the target RNA. In some embodiments, 1 1, 12, 13, 
14, 15, 16, 17, 18 or 19 (including the third from the 5' end) of 19 nucleotides are 
complementary with a sequence of the target RNA. In some embodiments, 11, 12, 
13, 14, 15, 16, 17, 18, 19 or 20 (including the third from the 5 9 end) of 20 nucleotides 
are complementary with a sequence of the target RNA. In some embodiments, 1 1, 
12, 13, 14, 15, 16, 17, 18, 19, 20 or 21 (including the third from the 5> end) of 21 
nucleotides are complementary with a sequence of the target RNA. In some 
embodiments, 1 1, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21 or 22 (including the third from 
the 5' end) of 22 nucleotides are complementary with a sequence of the target RNA. 
In some embodiments, 1 1, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22 or 23 (including 
the third from the 5' end) of 23 nucleotides are complementary with a sequence of the 
target RNA. In some embodiments, 1 1, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23 
or 24 (including the third from the 5' end) of 24 nucleotides are complementary with 
a sequence of the target RNA. In some embodiments, 1 1, 12, 13, 14, 15, 16, 17, 18, 
19, 20, 21, 22, 23, 24, or 25 (including the third from the 5' end) of 25 nucleotides are 
complementary with a sequence of the target RNA. In some embodiments, they are 
uniquely targeting siRNA sequences. In some embodiments, siRNA compounds and 
RISCs comprising such siRNA compounds are provided in which the siRNA 
sequence is a uniquely targeting siRNA sequence. In some preferred embodiments, 
the uniquely targeting siRNA and the RISCs comprising the same are directed against 
RNA encoding oncogenes, tumor suppressor genes, genes causing neurogenerative 
diseases, viral genes, cytokines, interleukins, chemokines, co-stimulatory molecules 
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and growth factors. In some preferred embodiments, the uniquely targeting siRNA 
and the RISCs comprising the same are directed against RNA encoding p53, MYC, 
MYB, PTEN, HER-2/NEU, TGF, PDGFR, Ras, Rb (Retinoblastoma protein), bcl-2, 
bax, phosphatase 2A, telomerase, pi 6, p21, Cyclin Dependent Kinases (CDKs), 
TRAIL, TNF, Mitogen Activated Kinases (MAPKs), VHL, NfkB, Merlin, 
Schwanomin, Notch, APC, AXIN2, E-Cadherin, QPC3, AXT1,2. Patched, SUFU, 
FH, SDHB,C,D, WT-1, STK1 1, TSC1, TSC2, BMPR1A, SMAD4, NF2, BHD, 
HRPT2, MUTYH, ATM, BLM, BRCA1, BRCA2, FANCA,C,D2,E,F,G, MSH2, 
MLH1, MSH6, PMS2, XPA, KT, MET, RET, HIF-1, b-catenin, FAS, FBXW7, GLI, 
HPEVE6, MDM2, AKT2, FOXOl A,3A, PI3KCA, CYCLIN Dl, HPVE7, TALI, 
TFE3, ALK, ABL1, BRAF, EGFR, EPHB2, ERBB2, FES, FGFRs, FLT3,4, JAK2, 
N-RAS, K-RAS, PDGFB, PDGFRB, EWSR1, RUNX1, SMAD2, TGFFBR1, 2, 
MYCN, HMGA2, HOXA9,ll,13, HOX11J1L2, MKK4, MLL, SS18, RARA, 
PTNP1,1 1, APP (Amyloid Precursor protein), Tau, synucleins, alpha-synuclein 
(PARK1), Parkin (PARK2), PARKS, Huntingtin, SOD (Superoxide Dismutase), 
Senataxin, Apolipoprotein-E, DJ-1 (PARL7), presenilin-1, presenilin-2, IL-1, EL-2, 
IL-4. TNF, Interferons, (IFNA, IFNB, IFN-gamma), CD4, CD8, CD19, CD5, CD43, 
CD1, CD3, B-cell receptor, T-cell Receptor, VEGF, bcr/abl, HIV genes such asrev, 
em, gag, or CMV genes. 

In preferred embodiments, the siRNAs and miRNAs described, designed, 
synthesized and used herein are active in the inhibition of RNA translation in animals, 
preferably humans, canines, felines, equines, bovines, porcines, ovines, avian species, 
and fish. In other embodiments, the siRNAs or miRNAs described, designed, 
synthesized and used herein are active in the inhibition of RNA translation in plants 
such as corn, soybean, maize, wheat, barley, sorghum, cotton, oilseed, Arabidopsis 
thaliana, aspen, pines, oak, legumes, tubers, fruits such as apples, oranges, lemons, 
limes, pears, pineapples, grapes, tomatos, peppers, squash, cucumber, melons and 
vegetables such as celery, broccoli, cauliflower, spinach, and carrots. 

EXAMPLE 

Bacterially produced proteins were expressed in BL21-DE3 CodonPlus cells 
(Stratagene) and were purified according to the instructions of the manufacturers. 
Untagged human eIF2C2 protein, His-tagged eIF2C2 (eIF2C2-His) and a fusion 
protein of eIF2C2 with Glutathione-S-Transferase (GST-eIF2C2) were expressed 
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from pTYB2 (NEB), pET28 (Novagen) and pGEX-6P2 (Amersham-Pharmacia) 
vectors respectively. Untagged or eIF2C2-His proteins were purified from the 
insoluble fraction and dialyzed gradually using decreasing amounts of Urea as per 
Novagen's instructions. GST-eIF2C2 protein retained solubility but was often 
degraded during the final purification steps which included either elution with 
glutathione or proteolytic release of eIF2C2 from GST with PreScission protease 
(Amersham-Pharmacia). All bacterially produced proteins were unable to bind 
synthetic RNAs corresponding either to single stranded let-7 (either with or without a 
5 '-phosphate), or to let-7 prepared as an siRNA duplex, and were devoid of 
endonuclease activity (data not shown). 

Protein-G purified 8C7 mAb against eEF2C2 or non immune mouse IgG were 
covalently cross-linked on agarose beads using the Seize Primary 
Immunoprecipitation Kit (Pierce) according to the manufacturer's instructions. The 
beads were subsequently treated with 0.1M Glycine pH 3.0 to remove free antibody 
and were washed with lysis buffer (20mM Tris-HCL, pH 7.4, 200mM NaCl, 2.5 mM 
MgC12). Total cell lysate prepared from 10 8 HeLa cells in lysis buffer supplemented 
with Complete Protease Inhibitors (Roche) and lOU/ml Rnasin (Promega) was 
applied to the 8C7 or to the IgG beads, as described in (3). After antigen capture, the 
beads were washed extensively with lysis buffer containing 500mM NaCl followed 
by washes in buffer D (20mM Hepes-KOH, pH 7.4, lOOmM KCL, O.lmM EDTA). 
Captured proteins were eluted by treating the beads with 2M Urea dissolved in buffer 
D; the eluate was immediately dialyzed against buffer D and was concentrated using 
Amicon devices (Microcon)./The final, concentrated eluate from the 8C7 beads 
contained approx. 4\ig of purified eIF2C2 protein. Proteins were resolved on 
NuPAGE (Invitrogen) gels and stained with Silver Stain or with SimpIyBlue 
SafeStain (Invitrogen). 

RNA was isolated from 0.5^g (5 pmoles) of immunopurified eEF2C2 protein, 
fractionated on 15% denaturing polyacrylamide gel along with known amounts of 
synthetic let-7 RNA (SEQ ID NO:l S'-UGAGGUAGUAGGUUGUAUAGU-S'; 
Dharmacon), transferred to a Hybond-N+ membrane (Amersham-Pharmacia) and 
hybridized with a 5' -end 32P-labeled DNA oligonucleotide probe complementary to 
let-7 (SEQ ID NO:2 5 , -ACTATACAACCTACTACCTCA-3 , ). 

All RNAs were chemically synthesized (Dharmacon). let7-TP and let7-TI 
contained a 5'-Biotin and were 3 '-end labeled with [5'- 32 P]-pCp and T4 RNA ligase 
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(NEB). All other RNAs were 5'-end labeled with [y- P]-ATP and T4 polynucleotide 
kinase (NEB). Cleavage assays were performed in a buffer containing 20mM Hepes- 
KOH, pH 7.4, 40mM KCL, 40mM Potassium Acetate, 2 mM MgC12, ImM DTT, 
10U/ml Rnasin, at 37° C, for 60 minutes or as indicated in the Figure legends. In 
some assays (Figure 2D), ImM ATP was included or MgC12, was omitted from the 
reactions. RNA was isolated from cleavage reactions and resolved in denaturing 
polyacrylamide gels. 

Since Argonaute proteins are essential for RNAi (22) (24) (25) and are the 
only protein components that miKNPs (3) and Drosophila (10) and human (26) RISCs 
have in common, it an Argonaute protein bound to mi/siRNAs may be the elusive 
miRNP/RISC endonuclease. To test this hypothesis the human eIF2C2 protein was 
studied because it is present both in human miRNPs (3) and human RISCs (26). 
Recombinant human eIF2C2 protein containing a single miRNA species was 
generated by expressing human eIF2C2 protein in bacteria (which do not contain 
miRNAs), and loading the purified, recombinant eIF2C2 protein with synthetic 
RNAs. However, all attempts to produce active recombinant eIF2C2 protein, using 
multiple systems, failed (31). eIF2C2 protein is tightly bound to miRNAs (see below) 
and it is very likely that proper folding and incorporation of miRNAs with elF2C2 can 
occi/r only in cells from organisms that contain miRNAs and miRNA-associated 
machinery. Expression of human elF2C2 protein in a heterologous eukaryotic 
expression system (such as the baculovirus-based, insect-cell system) should in 
principle allow for proper elF2C2 folding and miRNA incorporation. This approach, 
however, is complicated by the presence of numerous Argonaute protein paralogs 
within each organism (21) that may interfere with overexpression of exogenous, 
human eDF2C2 protein. For this reason and because knowledge about the biogenesis 
of eIF2C2-miRNA ribonucleoproteins is lacking, endogenous human elF2C2 was 
immunopurified from HeLa cells using monoclonal antibody (mAb) 8C7 which is 
described in (3), under conditions in which Dicer or the other protein components of 
miRNPs did not copurify (32). As shown in Figure 1 A, the 8C7 eluate contained a 
single protein with an apparent molecular mass of -95 KDa while the eluate from the 
negative control, non-immune mouse IgG contained no detectable proteins. 
Microsequencing by nanoelectrospray mass spectrometry confirmed the identity of 
the protein present in the 8C7 eluate as eBP2C2. 
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To test for the presence of miRNAs, RNA was isolated from total HeLa cell 
lysate, from lOOng of immunopurified elF2C2 and from an equivalent volume of 
control IgG eluate; the RNA was 3'-end-labeled with [5'- 32 P]-pCp and an aliquot was 
analyzed by denaturing polyacrylamide electrophoresis. As shown in Figure IB, 
~22nt RNAs corresponding to microRNAs are present only in the lane containing 
RNA from purified eIF2C2 (8C7 eluate). A much less abundant RNA band at ~80nt, 
present also in the IgG eluate, most likely represents traces of tRNA. These results 
show that miRNAs are stably bound to eIF2C2 and resist high salt washes (500mM 
NaCl) and Urea treatment. Numerous miRNAs associate with eIF2C2 containing 
miRNPs as single stranded forms (3). Similarly, siRNAs are found as single stranded 
molecules in purified human RISCs, which also contain eIF2C2 and eDF2Cl (26). 
Likewise, the immunopurified eIF2C2 protein is expected to contain a heterogeneous 
population of miRNAs. 

To calculate the amount of let-7a bound to eIF2C2 protein, quantitative 
Northern blot analysis was performed (33). As shown in Figure 1C/1D, 5 pmoles 
(0.5p,g) of purified eIF2C2 protein contains -0.1 pmoles of Iet-7a 9 showing that -2% 
of purified eIF2C2 is complexed with let-7a\ the remainder of the protein is most 
likely complexed with other miRNAs, although the possibility that a fraction of 
eEF2C2 protein is devoid of miRNAs cannot be exclude. 

To test whether the eluted, immunopurified eEF2C2 protein had microRNA- 
programmed endonuclease activity, the mi/siRNA mediated, in vitro cleavage assay 
developed by Tuschl and colleagues (5) and Zamore and colleagues (27) was used. In 
this assay, perfect complementarity between an si/miRNA and its RNA target leads to 
endonucleolytic cleavage of the RNA target at a site corresponding to the middle of 
the si/miRNA complementary site, while no cleavage is seen when the 
complementarity between the si/miRNA and the RNA target is partial. Two target 
RNA oligonucleotides, one bearing a site with perfect and another with imperfect 
complementarity to let-7a (let7-TP and let7-TI, respectively; Figure 2A), were 3 '-end 
labeled with [5'- 32 P]-pCp and were incubated with eluted, immunopurified eDP2C2 or 
IgG eluate (negative control), in the absence of ATP or other nucleotides (34). As 
shown in Figure 2B, immunopurified eIF2C2 directed the endonucleolytic cleavage of 
let7-TP at a site corresponding to the middle of the let-7a-complementary site, but did 
not cleave Iet7-TT; the IgG eluate showed no endonuclease activity. Next testedwas 
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whether ATP or magnesium ions were required for target RNA cleavage. As shown 
in Figure 2C, addition of ATP did not enhance target RNA cleavage but magnesium 
was required for endonucleolytic cleavage. These findings are consistent with 
previous studies demonstrating that target RNA cleavage does not require ATP (1 1), 
and demonstrate that in vitro, eIF2C2, an Argonaute protein, with its associated 
miRNAs is sufficient for both target RNA recognition and endonucleolytic cleavage. 
The general term miRgonaute is proposed to designate a ribonucleoprotein composed 
of an Argonaute protein complexed with mature miRNAs. Because of the purity of 
the miRNA containing eIF2C2 ribonucleoprotein (Figure 1 A/IB) used in the cleavage 
assays, it is unlikely that the endonuclease activity resides in another protein or RNA 
molecule (other that miRNAs bound to eIF2C2) that has escaped detection. The 
possibility cannot, however, be excluded that another small cofactor that is tightly 
bound to eIF2C2 or the miRNAs is also required for endonucleolytic activity. 

To calculate the rate of target RNA cleavage 1 pmole (lOOng) of purified 
eIF2C2 protein, containing -0.02 pmoles of let-7a miRNA with 0.4 pmoles of 
radiolabeled target RNA, let7-TP was incubated for 5, 30 and 90 minutes in a 
cleavage assay and quantitated the amount of product generated as a function of time. 
As shown in Figure 2C, 0.02 pmoles of let-7 miRNA, complexed with eIF2C2, 
cleaved ~r60% of the target RNA in 90 minutes, indicating that the let-7 containing 
eIF2C2 ribonucleoprotein catalyzed -12 rounds of target RNA cleavage. These 
results are consistent with the finding that in HeLa SI 00 extract miRNAs are 
components of multiple turnover enzymes and are in agreement with the estimated 
-10 rounds of target RNA cleavage mediated by HeLa SI 00 extracts (27). 

To investigate further the requirements for target RNA recognition and 
cleavage by purified eIF2C2/let-7, four additional RNAs were tested in cleavage 
assays: Iet7-TA, a 21-mer with perfect complementarity to let-7; Iet7-TB, a 17-mer 
that base pairs with the last 17 nucleotides (from the 5'-end) of let-7; let7-TC, a 17- 
mer that base pairs with the first 17 nucleotides of let-7; and Iet7-TD, a 21-mer that 
base pairs with let-7, leaving 2nt, 3*-overhangs, and mimicking an siRNA duplex 
(Figure 3A). As shown in Figure 3B, all target RNAs, except Iet7-TB, were cleaved 
by purified eIF2C2/let-7a, at the same position, between the 10th and 1 1th nucleotide 
starting from the 5 '-end of let-7a. These findings demonstrated, surprisingly, that 
miRgonautes may cleave RNAs that are shorter than their miRNAs and prompted us 
to investigate further the requirements for miRNA:target RNA recognition and 
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cleavage. A series of RNA targets with varying binding properties with let-7a were 
synthesized and tested them in cleavage assays. As shown in Figure 3C/3D, point 
mutations of the target RNA that disrupt base pairing with let-7a, had no effect in 
let7a-mediated target RNA cleavage with the exception of targets let7-TU and let7- 
TT. These two RNAs disrupt base pairing with nucleotides 10 and 1 1 of the let-7a 
miRNA, which have been shown to be critical for target RNA cleavage. Surprisingly, 
RNA targets as short as 11 nucleotides (Iet7-TG to Iet7-TM) were still cleaved by 
eIF2C2/let-7a, as long as they hybridized perfectly with the 5'-most half of let-7a 
(Figure 3E/3F). The 10-mer Iet7-TN RNA was not cleaved by eIF2C2/let-7a. An 
RNA target (Iet7-TN) that does not base pair with the first 2 nucleotides of let-7a is 
also cleaved by EIF2C2/let-7a. (Figure 3E/3F). These findings suggest that the first 
(from the 5 '-end) half of an miRNA is sufficient for target RNA recognition and 
cleavage and that even the first two nucleotides of an miRNA are dispensable for 
target RNA recognition and cleavage. Two longer RNA targets that base pair with 
the first 12 or 14 nucleotides of let-7a (Let7-TO and Iet7-TQ) were also tested. As 
shown in Figure 3E/3F, both of these targets were cleaved. Collectively, these 
findings have profound implications for miRNA function and for the use of siRNAs 
for knockdowns. This study shows that only a portion of an miRNA, bound to an 
Argonaute protein, is required for target RNA recognition and cleavage (summarized 
in a schematic form in Figure 4). Up to now computational prediction of miRNA 
targets (for miRNAs that act as siRNAs) has relied on near perfect complementarity 
between the entire nucleotide sequence of an miRNA and its putative mRNA target. 
This strategy ha$ successfully predicted miRNA targets for most plant miRNAs but 
failed to uncover targets for animal miRNAs (28). Some miRNAs may use only a 
portion of their sequence (as defined in this study) to recognize their targets. This is 
consistent with a recent report describing that only the first 13 nucleotides of the 
novel plant miRNA JAW, are required for target RNA recognition and cleavage in 
vivo (31). The second important finding that stems from this study is in the design of 
siRNAs for experimental knockdowns. A recent report has observed off target 
regulation by siRNAs, thus questioning the specificity of siRNAs (32). The critical 
determinant of siRNA specificity (as defined in this study) are the nucleotides shown 
in Figure 4. In order to design siRNAs with high specificity, and to avoid off target 
effects, these nucleotides should not share homology with any mRNA but the one that 
is targeted. 
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In summary, this study defines the nucleotides of an miRNA that are critical 
for target RNA regulation and cleavage and demonstrates that a miRgonaute (an 
Argonaute protein eIF2C2 with its associated miRNA -let-7a-) is a core 
miRNP/RISC endonuclease. It is likely that other Argonaute proteins act as 
mi/siRNA programmed endonucleases. It is also possible that in vivo, other proteins 
may be required for optimal target RNA recognition and/or miRNP/RISC activity. 
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